User interface language: English | Español

HL Paper 3

This question is about modelling the spread of a computer virus to predict the number of computers in a city which will be infected by the virus.


A systems analyst defines the following variables in a model:

The following data were collected:

A model for the early stage of the spread of the computer virus suggests that

Q't=βNQt

where N is the total number of computers in a city and β is a measure of how easily the virus is spreading between computers. Both N and β are assumed to be constant.

The data above are taken from city X which is estimated to have 2.6 million computers.
The analyst looks at data for another city, Y. These data indicate a value of β=9.64×108.

An estimate for Q(t), t5, can be found by using the formula:

Q'tQt+5-Qt-510.

The following table shows estimates of Q'(t) for city X at different values of t.

An improved model for Q(t), which is valid for large values of t, is the logistic differential equation

Q't=kQt1-QtL

where k and L are constants.

Based on this differential equation, the graph of Q'tQt against Q(t) is predicted to be a straight line.

Find the equation of the regression line of Q(t) on t.

[2]
a.i.

Write down the value of r, Pearson’s product-moment correlation coefficient.

[1]
a.ii.

Explain why it would not be appropriate to conduct a hypothesis test on the value of r found in (a)(ii).

[1]
a.iii.

Find the general solution of the differential equation Q't=βNQt.

[4]
b.i.

Using the data in the table write down the equation for an appropriate non-linear regression model.

[2]
b.ii.

Write down the value of R2 for this model.

[1]
b.iii.

Hence comment on the suitability of the model from (b)(ii) in comparison with the linear model found in part (a).

[2]
b.iv.

By considering large values of t write down one criticism of the model found in (b)(ii).

[1]
b.v.

Use your answer from part (b)(ii) to estimate the time taken for the number of infected computers to double.

[2]
c.

Find in which city, X or Y, the computer virus is spreading more easily. Justify your answer using your results from part (b).

[3]
d.

Determine the value of a and of b. Give your answers correct to one decimal place.

[2]
e.

Use linear regression to estimate the value of k and of L.

[5]
f.i.

The solution to the differential equation is given by

Qt=L1+Ce-kt

where C is a constant.

Using your answer to part (f)(i), estimate the percentage of computers in city X that are expected to have been infected by the virus over a long period of time.

[2]
f.ii.

Markscheme

Q(t)=3090t-54000  3094.27t-54042.3         A1A1


Note: Award at most A1A0 if answer is not an equation. Award A1A0 for an answer including either x or y.

 

[2 marks]

a.i.

0.755  0.754741         A1

 

[1 mark]

a.ii.

t is not a random variable OR it is not a (bivariate) normal distribution

OR data is not a sample from a population

OR data appears nonlinear

OR r only measures linear correlation         R1

 

Note: Do not accept “r is not large enough”.

 

[1 mark]

a.iii.

attempt to separate variables            (M1)

1QdQ=βNdt

lnQ=βNt+c           A1A1A1 

 

Note: Award A1 for LHS, A1 for βNt, and A1 for +c.

Award full marks for Q=eβNt+c  OR  Q=AeβNt.

Award M1A1A1A0 for Q=eβNt

 

[4 marks]

b.i.

attempt at exponential regression           (M1)

Q=1.15e0.292t  Q=1.14864e0.292055t           A1

OR

attempt at exponential regression           (M1)

Q=1.15×1.34t  1.14864×1.33917t           A1

 

Note: Condone answers involving y or x. Condone absence of “Q=” Award M1A0 for an incorrect answer in correct format.

 

[2 marks]

b.ii.

0.999  0.999431          A1

 

[1 mark]

b.iii.

comparing something to do with R2 and something to do with r        M1

 

Note:   Examples of where the M1 should be awarded:

R2>r
R>r
0.999>0.755
0.999>0.7552   =0.563
The “correlation coefficient” in the exponential model is larger.
Model B has a larger R2

Examples of where the M1 should not be awarded:

The exponential model shows better correlation (since not clear how it is being measured)
Model 2 has a better fit
Model 2 is more correlated

 

an unambiguous comparison between R2 and r2 or R and r leading to the conclusion that the model in part (b) is more suitable / better          A1

 

Note: Condone candidates claiming that R is the “correlation coefficient” for the non-linear model.

 

[2 marks]

b.iv.

it suggests that there will be more infected computers than the entire population       R1

 

Note: Accept any response that recognizes unlimited growth. 

 

[1 mark]

b.v.

1.15e0.292t=2.3  OR  1.15×1.34t=2.3  OR  t=ln20.292  OR using the model to find two specific times with values of Qt which double          M1

t=2.37  (days)          A1

 

Note: Do not FT from a model which is not exponential. Award M0A0 for an answer of 2.13 which comes from using (10, 20) from the data or any other answer which finds a doubling time from figures given in the table.

 

[2 marks]

c.

an attempt to calculate β for city X          (M1)


β=0.2920552.6×106  OR  β=ln1.339172.6×106

=1.12328×10-7          A1

this is larger than 9.64×10-8 so the virus spreads more easily in city X         R1

 

Note: It is possible to award M1A0R1.
Condone “so the virus spreads faster in city X” for the final R1.

 

[3 marks]

d.

a=38.3, b=3086.1          A1A1

 

Note: Award A1A0 if values are correct but not to 1 dp.

 

[2 marks]

e.

Q'Q=0.42228-2.5561×10-6Q          (A1)(A1)


Note:
Award A1 for each coefficient seen – not necessarily in the equation. Do not penalize seeing in the context of y and x.


identifying that the constant is k OR that the gradient is -kL          (M1)

therefore k=0.422   0.422228          A1

kL=2.5561×10-6

L=165000   165205          A1


Note:
Accept a value of L of 164843 from use of 3 sf value of k, or any other value from plausible pre-rounding.
Allow follow-through within the question part, from the equation of their line to the final two A1 marks.

 

[5 marks]

f.i.

recognizing that their L is the eventual number of infected        (M1)

1652052600000=6.35%    6.35403%          A1


Note:
Accept any final answer consistent with their answer to part (f)(i) unless their L is less than 120146 in which case award at most M1A0.

 

[2 marks]

f.ii.

Examiners report

A significant minority were unable to attempt 1(a) which suggests poor preparation for the use of the GDC in this statistics-heavy course. Large numbers of candidates appeared to use y, x, Q and t interchangeably. Accurate use of notation is an important skill which needs to be developed.

1(a)(iii) was a question at the heart of the Applications and interpretations course. In modern statistics many of the calculations are done by a computer so the skill of the modern statistician lies in knowing which tests are appropriate and how to interpret the results. Very few candidates seemed familiar with the assumptions required for the use of the standard test on the correlation coefficient. Indeed, many candidates answered this by claiming that the value was either too large or too small to do a hypothesis test, indicating a major misunderstanding of the purpose of hypothesis tests.

1(b)(i) was done very poorly. It seems that perhaps adding parameters to the equation confused many candidates – if the equation had been Q'(t)=5Q(t) many more would have successfully attempted this. However, the presence of parameters is a fundamental part of mathematical modelling so candidates should practise working with expressions involving them.

1(b)(ii) and (iii) were done relatively well, with many candidates using the data to recognize an exponential model was a good idea. Part (iv) was often communicated poorly. Many candidates might have done the right thing in their heads but just writing that the correlation was better did not show which figures were being compared. Many candidates who did write down the numbers made it clear that they were comparing an R2 value with an r value.

1(c) was not meant to be such a hard question. There is a standard formula for half-life which candidates were expected to adapt. However, large numbers of candidates conflated the data and the model, finding the time for one of the data points (which did not lie on the model curve) to double. Candidates also thought that the value of t found was equivalent to the doubling time, often giving answers of around 40 days which should have been obviously wrong.

1(d) was quite tough. Several candidates realized that β was the required quantity to be compared but very few could calculate β for city X using the given information.

1(e) was meant to be relatively straightforward but many candidates were unable to interpret the notation given to do the quite straightforward calculation.

1(f) was meant to be a more unusual problem-solving question getting candidates to think about ways of linearizing a non-linear problem. This proved too much for nearly all candidates.

a.i.
[N/A]
a.ii.
[N/A]
a.iii.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
b.iii.
[N/A]
b.iv.
[N/A]
b.v.
[N/A]
c.
[N/A]
d.
[N/A]
e.
[N/A]
f.i.
[N/A]
f.ii.



In this question you will explore possible models for the spread of an infectious disease

An infectious disease has begun spreading in a country. The National Disease Control Centre (NDCC) has compiled the following data after receiving alerts from hospitals.

A graph of n against d  is shown below.

The NDCC want to find a model to predict the total number of people infected, so they can plan for medicine and hospital facilities. After looking at the data, they think an exponential function in the form n = a b d could be used as a model.

Use your answer to part (a) to predict

The NDCC want to verify the accuracy of these predictions. They decide to perform a χ 2 goodness of fit test.

The predictions given by the model for the first five days are shown in the table.

In fact, the first day when the total number of people infected is greater than 1000 is day 14, when a total of 1015 people are infected.

Based on this new data, the NDCC decide to try a logistic model in the form  n = L 1 + c e k d .

Use the data from days 1–5, together with day 14, to find the value of

Use an exponential regression to find the value of a  and of b , correct to 4 decimal places.

[3]
a.

the number of new people infected on day 6.

[3]
b.i.

the day when the total number of people infected will be greater than 1000.

[2]
b.ii.

Use your answer to part (a) to show that the model predicts 16.7 people will be infected on the first day.

[1]
c.

Explain why the number of degrees of freedom is 2.

[2]
d.i.

Perform a χ 2 goodness of fit test at the 5% significance level. You should clearly state your hypotheses, the p-value, and your conclusion.

[5]
d.ii.

Give two reasons why the prediction in part (b)(ii) might be lower than 14.

[2]
e.

L .

[2]
f.i.

c .

[1]
f.ii.

k .

[1]
f.iii.

Hence predict the total number of people infected by this disease after several months.

[2]
g.

Use the logistic model to find the day when the rate of increase of people infected is greatest.

[3]
h.

Markscheme

a = 9.7782 , b = 1.7125      M1A1A1

[3 marks]

a.

n ( 6 ) = 247        A1

number of new people infected = 247 – 140 = 107     M1A1

[3 marks]

b.i.

use of graph or table      M1

day 9    A1

[2 marks]

b.ii.

9.7782(1.7125)1      M1

= 16.7 people    AG

[1 mark]

c.

2 parameters ( a and b ) were estimated from the data.     R1

υ = 5 1 2      M1

= 2    AG

[2 marks]

d.i.

H 0 :  data is modeled by  n ( d ) = 9.7782 ( 1.7125 ) d and  H 1 :  data is not modeled by  n ( d ) = 9.7782 ( 1.7125 ) d      A1

p-value = 0.893    A2

Since 0.893 > 0.05     R1

Insufficient evidence to reject H 0 . So data is modeled by  n ( d ) = 9.7782 ( 1.7125 ) d     A1

[5 marks]

d.ii.

vaccine or medicine might slow down rate of infection     R1

People become more aware of disease and take precautions to avoid infection     R1

Accept other valid reasons

[2 marks]

e.

1060      M1A1

[2 marks]

f.i.

108      A1

[1 mark]

f.ii.

0.560     A1

[1 mark]

f.iii.

As  d        M1

n 1060       A1

[2 marks]

g.

sketch of  d n d d or solve  d 2 n d d 2 = 0         M1

d = 8.36       A1

Day 8      A1

[3 marks]

h.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
e.
[N/A]
f.i.
[N/A]
f.ii.
[N/A]
f.iii.
[N/A]
g.
[N/A]
h.



An estate manager is responsible for stocking a small lake with fish. He begins by introducing 1000 fish into the lake and monitors their population growth to determine the likely carrying capacity of the lake.

After one year an accurate assessment of the number of fish in the lake is taken and it is found to be 1200.

Let N be the number of fish t years after the fish have been introduced to the lake.

Initially it is assumed that the rate of increase of N will be constant.

When t=8 the estate manager again decides to estimate the number of fish in the lake. To do this he first catches 300 fish and marks them, so they can be recognized if caught again. These fish are then released back into the lake. A few days later he catches another 300 fish, releasing each fish after it has been checked, and finds 45 of them are marked.

Let X be the number of marked fish caught in the second sample, where X is considered to be distributed as Bn,p. Assume the number of fish in the lake is 2000.

The estate manager decides that he needs bounds for the total number of fish in the lake.

The estate manager feels confident that the proportion of marked fish in the lake will be within 1.5 standard deviations of the proportion of marked fish in the sample and decides these will form the upper and lower bounds of his estimate.

The estate manager now believes the population of fish will follow the logistic model Nt=L1+Ce-kt where L is the carrying capacity and C,k>0.

The estate manager would like to know if the population of fish in the lake will eventually reach 5000.

Use this model to predict the number of fish in the lake when t=8.

[2]
a.

Assuming the proportion of marked fish in the second sample is equal to the proportion of marked fish in the lake, show that the estate manager will estimate there are now 2000 fish in the lake.

[2]
b.

Write down the value of n and the value of p.

[2]
c.i.

State an assumption that is being made for X to be considered as following a binomial distribution.

[1]
c.ii.

Show that an estimate for Var(X) is 38.25.

[2]
d.i.

Hence show that the variance of the proportion of marked fish in the sample, VarX300, is 0.000425.

[2]
d.ii.

Taking the value for the variance given in (d) (ii) as a good approximation for the true variance, find the upper and lower bounds for the proportion of marked fish in the lake.

[2]
e.i.

Hence find upper and lower bounds for the number of fish in the lake when t=8.

[2]
e.ii.

Given this result, comment on the validity of the linear model used in part (a).

[2]
f.

Assuming a carrying capacity of 5000 use the given values of N0 and N1 to calculate the parameters C and k.

[5]
g.i.

Use these parameters to calculate the value of N8 predicted by this model.

[2]
g.ii.

Comment on the likelihood of the fish population reaching 5000.

[2]
h.

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.

N8=1000+200×8        M1

=2600        A1

 

[2 marks]

a.

45300=300N        M1A1

N=2000        AG

 

[2 marks]

b.

n=300, p=3002000=0.15     A1A1

 

[2 marks]

c.i.

Any valid reason for example:          R1

Marked fish are randomly distributed, so p constant.

Each fish caught is independent of previous fish caught

 

[1 mark]

c.ii.

Var(X)=np1-p          M1

=300×3002000×17002000          A1

=38.25          AG

 

[2 marks]

d.i.

VarX300=Var(X)3002          M1A1

=0.000425             AG

 

[2 marks]

d.ii.

0.15±1.50.000425          (M1)

0.181 and 0.119             A1

 

[2 marks]

e.i.

300N=0.181,  300N=0.119            M1

Lower bound 1658 upper bound 2519             A1

 

[2 marks]

e.ii.

Linear model prediction falls outside this range so unlikely to be a good model            R1A1

 

[2 marks]

f.

1000=50001+C            M1

C=4            A1

1200=50001+4e-k            M1

e-k=38004×1200            (M1)

k=-ln0.7916=0.2336         A1

 

[5 marks]

g.i.

N8=50001+4e-0.2336×8=3090         M1A1

 

Note: Accept any answer that rounds to 3000.

 

[2 marks]

g.ii.

This is much higher than the calculated upper bound for N(8) so the rate of growth of the fish is unlikely to be sufficient to reach a carrying capacity of 5000.        M1R1

 

[2 marks]

h.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.i.
[N/A]
c.ii.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
e.i.
[N/A]
e.ii.
[N/A]
f.
[N/A]
g.i.
[N/A]
g.ii.
[N/A]
h.



Consider the functions f g R × R R × R  defined by

f ( ( x , y ) ) = ( x + y , x y ) and  g ( ( x , y ) ) = ( x y , x + y ) .

Find  ( f g ) ( ( x , y ) ) .

[3]
a.i.

Find ( g f ) ( ( x , y ) ) .

[2]
a.ii.

State with a reason whether or not f and g commute.

[1]
b.

Find the inverse of  f .

[3]
c.

Markscheme

( f g ) ( ( x , y ) ) = f ( g ( ( x , y ) ) )   ( = f ( ( x y , x + y ) ) )       (M1)

= ( x y + x + y , x y x y )        A1A1

 

[3 marks]

a.i.

( g f ) ( ( x , y ) ) = g ( f ( ( x , y ) ) )

= g ( ( x + y , x y ) )

= ( ( x + y ) ( x y ) , x + y + x y )

= ( x 2 y 2 , 2 x )        A1A1

 

[2 marks]

a.ii.

no because  f g g f         R1

Note: Accept counter example.

 

[1 mark]

b.

 

f ( ( x , y ) ) = ( a , b ) ( x + y , x y ) = ( a , b )        (M1)

{ x = a + b 2 y = a b 2        (M1)

f 1 ( ( x , y ) ) = ( x + y 2 , x y 2 )         A1

 

[3 marks]

c.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.
[N/A]
c.



A suitable site for the landing of a spacecraft on the planet Mars is identified at a point, A. The shortest time from sunrise to sunset at point A must be found.

Radians should be used throughout this question. All values given in the question should be treated as exact.

Mars completes a full orbit of the Sun in 669 Martian days, which is one Martian year.

On day t, where t , the length of time, in hours, from the start of the Martian day until sunrise at point A can be modelled by a function, R(t), where

R(t)=asinbt+c, t.

The graph of R is shown for one Martian year.

Mars completes a full rotation on its axis in 24 hours and 40 minutes.

The time of sunrise on Mars depends on the angle, δ, at which it tilts towards the Sun. During a Martian year, δ varies from 0.440 to 0.440 radians.

The angle, ω, through which Mars rotates on its axis from the start of a Martian day to the moment of sunrise, at point A, is given by cosω=0.839tanδ, 0ωπ.

Use your answers to parts (b) and (c) to find

Let S(t) be the length of time, in hours, from the start of the Martian day until sunset at point A on day t. S(t) can be modelled by the function

S(t)=1.5sin(0.00939t+2.83)+18.65.

The length of time between sunrise and sunset at point A, L(t), can be modelled by the function

L(t)=1.5sin(0.00939t+2.83)1.6sin(0.00939t)+d.

Let f(t)=1.5sin(0.00939t+2.83)1.6sin(0.00939t) and hence L(t)=f(t)+d.

f(t) can be written in the form Im(z1z2) , where z1 and z2 are complex functions of t.

Show that b0.00939.

[2]
a.

Find the angle through which Mars rotates on its axis each hour.

[3]
b.

Show that the maximum value of ω=1.98, correct to three significant figures.

[3]
c.i.

Find the minimum value of ω.

[1]
c.ii.

the maximum value of R(t).

[2]
d.i.

the minimum value of R(t).

[1]
d.ii.

Hence show that a=1.6, correct to two significant figures.

[2]
e.

Find the value of c.

[2]
f.

Find the value of d.

[2]
g.

Write down z1 and z2 in exponential form, with a constant modulus.

[3]
h.i.

Hence or otherwise find an equation for L in the form L(t)=psin(qt+r)+d, where p, q, r, d.

[4]
h.ii.

Find, in hours, the shortest time from sunrise to sunset at point A that is predicted by this model.

[2]
h.iii.

Markscheme

recognition that period =669                  (M1)

b=2π669  OR  b=0.00939190                A1


Note:
Award A1 for a correct expression leading to the given value or for a correct value of b to 4 sf or greater accuracy.


b0.00939             AG


[2 marks]

a.

length of day=2423 hours                (A1)


Note: Award A1 for 23, 0.666, 0.6¯ or 0.667.


2π2423               (M1)


Note: Accept 3602423.


=0.255 radians 0.254723, 3π37, 14.5945°               A1

[3 marks]

b.

substitution of either value of δ into equation              (M1)

correct use of arccos to find a value for ω              (M1)


Note: Both (M1) lines may be seen in either part (c)(i) or part (c)(ii).


cosω=0.839tan-0.440               A1

ω=1.97684

1.98               AG


Note: For substitution of 1.98 award M0A0.

 

[3 marks]

c.i.

δ=0.440

ω=1.16  (1.16474)            A1

 

[1 mark]

c.ii.

Rmax=1.976840.25472           (M1)

=7.76 hours  (7.76075)            A1


Note: Accept 7.70 from use of 1.98.

[2 marks]

d.i.

Rmin=1.164740.25472

=4.57 hours  (4.57258)            A1


Note: Accept 4.55 and 4.56 from use of rounded values.

[1 mark]

d.ii.

a=7.76075-4.572582           M1

1.59408           A1


Note: Award M1 for substituting their values into a correct expression. Award A1 for a correct value of a from their expression which has at least 3 significant figures and rounds correctly to 1.6.

1.6 (correct to 2 sf)          AG

 

[2 marks]

e.

EITHER

c=7.76075+4.572582 =12.3332           (M1)


OR

c=4.57258+1.59408  or  c=7.76075-1.59408


THEN

=6.17  6.16666           A1


Note: Accept 6.16 from use of rounded values. Follow through on their answers to part (d) and 1.6.


[2 marks]

f.

d=18.65-6.16666           (M1)

=12.5  12.4833           A1


Note: Follow through for 18.65 minus their answer to part (f).


[2 marks]

g.

at least one expression in the form regti           (M1)

z1=1.5e0.00939t+2.83i,  z2=1.6e0.00939ti           A1A1


[3 marks]

h.i.

EITHER

z1-z2=1.5e0.00939t+2.83i-1.6e0.00939ti

=e0.00939ti1.5e2.83i-1.6               (M1)

=e0.00939ti3.06249e2.99086i            (A1)(A1)


OR

graph of L or f

p=3.06249...            (A1)

r=-0.150729...  OR  r=2.99086...            (M1)(A1)


Note: The p and r variables (or equivalent) must be seen.


THEN

L(t)=3.06sin(0.00939t+2.99)+12.5                 A1

L(t)=3.06248sin(0.00939t+2.99086)+12.4833


Note: Accept equivalent forms, e.g. L(t)=3.06sin(0.00939t-0.151)+12.5.
Follow through on their answer to part (g) replacing 12.5.

 
[4 marks]

h.ii.

shortest time between sunrise and sunset

12.4833-3.06249               (M1)

=9.42 hours  9.420843                 A1


Note:
Accept 9.44 from use of 3 sf values.

[2 marks]

h.iii.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.i.
[N/A]
c.ii.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
e.
[N/A]
f.
[N/A]
g.
[N/A]
h.i.
[N/A]
h.ii.
[N/A]
h.iii.



This question explores methods to determine the area bounded by an unknown curve.

The curve  y = f ( x )  is shown in the graph, for  0 x 4.4 .

The curve y = f ( x )  passes through the following points.

It is required to find the area bounded by the curve, the  x -axis, the y -axis and the line  x = 4.4 .

One possible model for the curve  y = f ( x )  is a cubic function.

A second possible model for the curve  y = f ( x )  is an exponential function,  y = p e q x , where  p , q R .

Use the trapezoidal rule to find an estimate for the area.

[3]
a.i.

With reference to the shape of the graph, explain whether your answer to part (a)(i) will be an over-estimate or an underestimate of the area.

[2]
a.ii.

Use all the coordinates in the table to find the equation of the least squares cubic regression curve.

[3]
b.i.

Write down the coefficient of determination.

[1]
b.ii.

Write down an expression for the area enclosed by the cubic function, the x -axis, the y -axis and the line x = 4.4 .

[2]
c.i.

Find the value of this area.

[2]
c.ii.

Show that  ln y = q x + ln p .

[2]
d.i.

Hence explain how a straight line graph could be drawn using the coordinates in the table.

[1]
d.ii.

By finding the equation of a suitable regression line, show that  p = 1.83 and  q = 0.986 .

[5]
d.iii.

Hence find the area enclosed by the exponential function, the x -axis, the y -axis and the line x = 4.4 .

[2]
d.iv.

Markscheme

Area  = 1.1 2 ( 2 + 2 ( 5 + 15 + 47 ) + 148 )          M1A1

Area = 156 units2          A1

[3 marks]

a.i.

The graph is concave up,         R1

so the trapezoidal rule will give an overestimate.         A1

[2 marks]

a.ii.

f ( x ) = 3.88 x 3 12.8 x 2 + 14.1 x + 1.54          M1A2

[3 marks]

b.i.

R 2 = 0.999         A1

[1 mark]

b.ii.

Area  = 0 4.4 ( 3.88 x 3 12.8 x 2 + 14.1 x + 1.54 ) d x         A1A1

[2 marks]

c.i.

Area = 145 units2    (Condone 143–145 units2, using rounded values.)      A2

[2 marks]

c.ii.

ln y = ln ( p e q x )       M1

ln y = ln p + ln ( e q x )       A1

ln y = q x + ln p       AG

[2 marks]

d.i.

Plot  ln y against p .      R1

[1 mark]

d.ii.

Regression line is  ln y = 0.986 x + 0.602        M1A1

So  q = gradient = 0.986    R1

p = e 0.602 = 1.83        M1A1

[5 marks]

d.iii.

Area  = 0 4.4 1.83 e 0.986 x d x = 140  units2     M1A1

[2 marks]

d.iv.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.i.
[N/A]
c.ii.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
d.iii.
[N/A]
d.iv.